A Toolkit for ARB to Integrate Custom Databases and Externally Built Phylogenies
نویسندگان
چکیده
UNLABELLED Researchers are perpetually amassing biological sequence data. The computational approaches employed by ecologists for organizing this data (e.g. alignment, phylogeny, etc.) typically scale nonlinearly in execution time with the size of the dataset. This often serves as a bottleneck for processing experimental data since many molecular studies are characterized by massive datasets. To keep up with experimental data demands, ecologists are forced to choose between continually upgrading expensive in-house computer hardware or outsourcing the most demanding computations to the cloud. Outsourcing is attractive since it is the least expensive option, but does not necessarily allow direct user interaction with the data for exploratory analysis. Desktop analytical tools such as ARB are indispensable for this purpose, but they do not necessarily offer a convenient solution for the coordination and integration of datasets between local and outsourced destinations. Therefore, researchers are currently left with an undesirable tradeoff between computational throughput and analytical capability. To mitigate this tradeoff we introduce a software package to leverage the utility of the interactive exploratory tools offered by ARB with the computational throughput of cloud-based resources. Our pipeline serves as middleware between the desktop and the cloud allowing researchers to form local custom databases containing sequences and metadata from multiple resources and a method for linking data outsourced for computation back to the local database. A tutorial implementation of the toolkit is provided in the supporting information, S1 Tutorial. AVAILABILITY http://www.ece.drexel.edu/gailr/EESI/tutorial.php.
منابع مشابه
Visualization schemas and a web-based architecture for custom multiple-view visualization of multiple-table databases
Relational databases provide significant flexibility to organize, store, and manipulate an infinite variety of complex data collections. This flexibility is enabled by the concept of relational data schemas, which allow data owners to easily design custom databases according to their unique needs. However, user interfaces and information visualizations for accessing and utilizing databases have...
متن کاملTools and Infrastructure for Supporting Enterprise Knowledge Graphs
We demonstrate EKG, a collection of tools and back-end infrastructure for creating custom, domain specific knowledge graphs. The toolkit is geared toward enterprises and government organizations where domain specific knowledge graphs are often not available. During the demo, audience members will be able to ingest their own documents and instantiate their own knowledge graphs and update them in...
متن کاملCalculation of Positron Distribution in the Presence of a Uniform Magnetic Field for the Improvement of Positron Emission Tomography (PET) Imaging Using GEANT4 Toolkit
Introduction Range and diffusion of positron-emitting radiopharmaceuticals are important parameters for image resolution in positron emission tomography (PET). In this study, GEANT4 toolkit was applied to study positron diffusion in soft tissues with and without a magnetic field for six commonly used isotopes in PET imaging including 11C, 13N, 15O, 18F, 68Ga, and 82Rb. Materials and Methods GEA...
متن کاملSynchMe -- A Toolkit for Multimedia Synchronization Research
SynchME is a toolkit for multimedia synchronization research. The toolkit provides a consistent low-level base that includes media stream support, clocks, and a constraint mechanism. The lowlevel base can then be extended to construct higher-level synchronization abstractions like skew control. Once defined, these abstractions can be re-used, integrated, and modified into a widevariety of synch...
متن کاملComparison of Exact Analysis and Steplines Approximation for Externally Excited Exponential Transmission Line
In the present paper, the problem of externally excited exponential transmission line hasbeen solved analytically in frequency domain using a simple approach. Then steplines approximationas a first order approximation for the problem of externally excited nonuniform transmission lines ingeneral and exponentially tapered transmission line (ETL) as a special case has been presented.Finally the tw...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 10 شماره
صفحات -
تاریخ انتشار 2015